Enriching the Spanish EuroWordNet by Collocations
نویسندگان
چکیده
Collocations constitute an important type of syntagmatic information whose introduction into WordNets has not yet been addressed. The goal of our work is the integration of the collocational material for the field of emotion nouns encoded in the DIccionario de colocaciones del español (DICE) in terms of Lexical Functions into the Spanish part of the EuroWordNet (SpEWN). Two features of collocations are decisive in connection with their representation in SpEWN: (i) they are variant-specific rather than synset-specific and (ii) depending on the degree of their idiosyncrasy, they may be generalized to a certain degree. These features are accounted for by introducing new structures into the SpEWN. These new structures are compatible with the general design principles of the EWN. Given that SpEWN and DICE reveal a diverging distinction of the senses of lexical items, prior to the introduction of collocations from DICE into SpEWN, the senses of the elements of collocations in SpEWN and DICE are aligned.
منابع مشابه
Making sense of collocations q
Lexico-semantic collocations (LSCs) are a prominent type of multiword expressions. Over the last decade, the automatic compilation of LSCs from text corpora has been addressed in a significant number of works. However, very often, the output of an LSC-extraction program is a plain list of LSCs. Being useful as raw material for dictionary construction, plain lists of LSCs are of a rather limited...
متن کاملQuantitative Comparison of Homonymy in Spanish EuroWordNet and Traditional Dictionaries
A quantitative comparative study of homonymy in four well-known electronic Spanish dictionaries—EuroWordNet and three traditional dictionaries—is presented. It is shown that though structuring of word senses is quite different in all dictionaries under comparison, EuroWordNet differs from the traditional dictionaries much more than these differ from each other. It is also shown that the orderin...
متن کاملMonolingual and bilingual dictionary approaches to the enrichment of the Spanish WordNet with adjectives
We report on two different approaches to the incorporation of adjectives in Spanish WordNet based on automatic extraction techniques using EuroWordNet and machine-readable dictionaries. We show that a monolingual dictionary approach enables to exploit relations between different parts of speech and enrich the internal structure of the Spanish WordNet, while the methods based on bilingual dictio...
متن کاملEnriching EWN with Syntagmatic Information by Means of WSD
Word Sense Disambiguation confronts with the lack of syntagmatic information associated to word senses. In the present work we propose a method for the enrichment of EuroWordNet with syntagmatic information, by means of the WSD process itself. We consider that an ambiguous occurrence drastically reduces its ambiguity when considered together with the words it establishes syntactic relations in ...
متن کاملMultilingual Ontology-based User Profile Enrichment
In this paper, we discuss the possibility of enriching user profiles with multilingual information. Nowadays, the English language is the de facto standard language of commerce and science, however users can speak and interact also in other languages. This brings up the need of enriching the user profiles with multilingual information. Therefore, we propose to combine ontology-based user modeli...
متن کامل